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Abstract 


Solutions to the classic problems of dealing with heterogeneous data and making entire collec- 
tions interoperable while ensuring that any annotation, which includes the recognition- and-reward 
system of scientific publishing, need to fit into a seamless beginning to end to attract large numbers 
of end users. The latest trend in Web applications encourages highly interactive Web sites with rich 
user interfaces featuring content integrated from various sources around the Web. The obvious po- 
tential of RDF, SPARQL, and OWL to provide flexible data modeling, easier data integration, and 
networked data access may be the answer to the classic problems. Using Semantic Web technologies 
we have created a Web application, the ConceptWiki, as an end-to-end solution for creating browser- 
based readwrite triples using RDF, which focus on data integration and ease of use for the end user. 
Here we will demonstrate the integration of a biological data source, the ENZYME database, into the 
ConceptWiki and it's representation in RDF. 
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El Preferences 

EC 1.1.1.1 


Identifier: 5SS56720-61ee-lldf-b0cb-001517a.c506c 

B Also referred to as 

Term/Token Preferred in Community Swiss-Pro t ENZYME 

m 




Aicohol dehydrogenase life sciences (v^ 
Aldehyde reductase 
EC 1.1.1.1 


B Tags 

Tag Community Swiss-Prot EMZYME 

amino acid, peptide or protein 

enzyme commission number 

B More about this concept 

B Functional information 
Connection Element 

has catalytic activity An alcohol + IMAD(+) - an aldehyde or ketone + WADH 
has cofactor Iron or zinc 


Community ErJZYME 





B Conceptual and other information 
Connection Element 

has miscellaneous information Acts on primary or secondary alcohols or hemiacetals. 
has miscellaneous information The animal, but not the yeast, enzyme acts also on cyclic secondary alcohols. 


Community ENZYME 





B Reference 

Connection Element Identifier Community Swiss-Prot ENZYME 

source Enzyme Commission numbers EC 1.1.1.1 

source EhZYME EC 1.1.1.1 

B Pointers to EC 1.1.1.1 

There are too many results to display. Please click here to see a new window with the full list of concepts that have a connection with this concept. 
B Information in other websites 


EC 1.1.1.1 rn ENZYME 1^ 
EC 1.1.1.1 in UnrProtiSJ 
EC l.l.l.l in KEGG^ 


Figure 1: ConceptWiki 

The ConceptWiki (www.conceptwiki.org, Figure 1) is an open access repository of editable con- 
cepts. It is a web based wiki system that accepts essentially unlimited numbers of synonyms, in 
multiple languages, and then maps all the terms correctly back to one unique concept identifier, allevi- 
ating problems of vocabulary and identifier differences. Each concept in the ConceptWiki is annotated 
with one or more semantic types and basic information like a definition. Users can view and edit in- 
formation through a uniform interface. The information in the system is stored and edited in a highly 
structured way, as triples (e.g. j concept jhas synonymy jterm B^). The ConceptWiki backend 
has been designed to support the storage of concepts in a very generic form, thereby trying to avoid 
as much as possible the exclusion of potential valuable information sources. This compatibility with 
our other information storage systems enables higherlevel applications to easily query, summarize and 
mine the knowledge. In line with recommendations from the Concept Web Alliance, identifiers in 
the ConceptWiki are completely opaque; they have no inherent structure and no information can be 
derived from them. An opaque identifier is a robust identifier as there will never be a need to change 
the identifier when underlying information changes. The ConceptWiki uses Universally Unique Iden- 
tifiers (UUIDs) for identifiers. 
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The ConceptWiki contains the biomedical terminology of Unified Medical Language System 
thesauri, concepts from SwissProt, Medline, and in the near future the repository will be expanded to 
incorporate the chemical terminology from ChemSpider for biologically relevant chemical molecules. 
In this poster, we demonstrate the import process for another biologically relevant database, ENZYME 
(Figure 2). For this import, the ENZYME flat file is first converted into XML. The import script 
incorporates a XML parser which queries the ConceptWiki database to recover all concepts that match 
those in the XML. For the ENZYME data, these are concepts containing an EC number as a synonym. 
The stored ConceptWiki information is then compared to the ENZYME data. If the ENZYME data 
is unknown to the ConceptWiki, then it is inserted into the database. If the ENZYME data is found 
but has changes in comparison to the ConceptWiki, then the stored information is updated. These 
changes are shown in the interface by removing the authority checkbox if data are no longer supported 
by ENZYME, and if the data are new the checkbox for ENZYME authority is added. EC 1.1.1.1 has 
synonym Aldehyde reductase, is an example of the ENZYME data generated in the subject-predicate- 
object triple structure which in XML is the triple structure used by the ConceptWiki backend. Each 
element is stored with a UUID. 

The RDF data model (Figure 3) is similar to classic conceptual modeling approaches as it is 
based upon the idea of making statements about resources, in particular Web resources, in the form 
of subject-predicate-object expressions. The predicates of RDF triples are similar to hyperlinks; how- 
ever, the advantage of RDF triples over HTML hyperlinks is that the links are explicitly labeled. The 
semantics of the relationship between the two entities is computationally accessible through URI res- 
olution and of particular interest; the data for the ConceptWiki predicate is represented in RDF using 
the concept UUID. Of course, as would be expected, the subject and object are also represented in 
RDF using the ConceptWiki UUIDs. 


<rdf:RDF> 

-<rdf: Description nif:aboiit= 'http://paia.nbiceng,neVindex,php/Concept:58856720-61ee41d^^^^ 
<dctertns:TitIe>EC 1,1,1,1 </dcterms; Title > 

< dc terms :ideiitifier>58856720-61ee-lldf-b0cb-00l5l7ac506c </dctenns : Iden tif ier > 

< dc terms : alt em ative >Alcoholdehydrogenase</dcte nn s : alte rri at ive > 
<dcterms: alternative >Aldehyde reductase</dctenns : alternative > 

< dc terms : Type rdf i re s oiirc e =" http ://p aul , nta ic eng , net/index , p hp/C o nc ept :adcc36e6-5c2b-lldf-]30cb-00l5l7ac506c'V> 
<dcterms:Type rdf;resoiirce=''htt^:/y^aiil, nbiceng, net/index, php/Concept:add3fa02-5c2b-lldf-b 

<cw:adef27fa-5c2b-lldf-b0cb-001517ac50ec rdf:resoiirce=''http://paul,nbiceng,net/mdex,php/Concept:0aclbd20-5de041df-b0cb-^ 

<cw:ade2edb4-5c2b-lldf-b0cb-001517ac506c>Aii alcohol + NAD(+) = an aldehyde or ketone + NADH</cw:ade2edb4-5c2b-lldf-b0cb-001517ac50ec> 
<cw:ade65b3e-5c2b-lldf-b0cb-001517ac506c>Iron or zmc</cw:ade65b3e-5c2b-lldf-b0cb-001517ac506c> 
-<cw:ae2c5b7a-5c2b-lldf-b0cb-001517ac506c> 
Acts on primary or secondary alcohols or hemiacetals. 
</cw:ae2c5b7a-5c2b-lldf-b0cb-001517ac506c> 
-<cw:ae2c5b7a-5c2b-lldf-b0cb-001517ac50ec> 

The animal, but not the yeast, enzyme acts also on cyclic secondary alcohols, 
</cw:ae2c5b7a-5c2b-lldf-b0cb-001517ac506c> 
</rdf : D e s criptio n > 
</rdf:RDF> 


Figure 3: RDF of ENZYME data 


Using the concepts present in the ConceptWiki as a set of basic building blocks, new triples can 
be assembled by end users via the interface. Through the simple drop down menus, users can establish 
links between two records to illustrate pertinent connections. All newly created triples are attributed 
to the participating scientist by showing their name and listing the triple on page that represents them. 
For example, the triple Aldehyde reductase has function sorbitol biosynthetic process can be built 
using data included from the ENYME import (Figure 4). Certain agencies have indicated that they are 
interested in using the resulting contributions as indices of scholarly achievement. 
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